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ABSTRACT 

We study the potential of weak lensing surveys to detect clusters of galaxies, using a fast Parti- 
cle Mesh cosmological N-body simulation algorithm specifically tailored to investigate the statistics 
of these shear selected clusters. In particular, we explore the degree to which the radial positions 
of galaxy clusters can be determined tomographically, by using photometric redshifts of background 
source galaxies. We quantify the errors in the tomographic redshifts, Az = Ztomography — -Zreai, and 
study their dependence on mass, redshift, detection significance, and filtering scheme. For clusters 
detected with signal to noise ratio S/N > 4.5, the fraction of clusters with tomographic redshift errors 
|Az| < 0.2 is 45% and the root mean square deviation from the real redshifts is ((Az)^)^/^ = 0.41, 
with smaller errors for higher S/N ratios. A Tomographic Matched Filtering (TMF) scheme, which 
combines tomography and matched filtering, is introduced which optimally detects clusters of galaxies 
in weak lensing surveys. The TMF exploits the extra information provided by photometric redshifts 
of background source galaxies, neglected in previous studies, to optimally weight the sources. The 
efficacy and reliability of the TMF is investigated using a large ensemble of mock observations from 
our simulations and detailed comparisons are made to other filters. Using photometric redshift in- 
formation with the TMF enhances the number of clusters detected with S/N > 4.5 by as much as 
76%, and it increases the dynamic range of weak lensing searches for clusters, detecting more high 
redshift clusters and extending the mass sensitivity down to the scale of large groups. Furthermore, 
we find that coarse redshift binning of source galaxies with as few as three bins is sufficient to realize 
the gains of the TMF. Thus, the tomographic filtering techniques presented here can be applied to 
current ground based weak lensing data in as few as three bands, since two colors and magnitude prior 
are sufficient to bin source galaxies this coarsely. Cosmological applications of shear selected cluster 
samples are also discussed. 

Subject headings: cosmology: theory - methods: numerical - clusters: general - large scale structure 
of the universe - gravitational lensing 



1. INTRODUCTION 

Mapping the distribution of dark matter in the uni- 
verse is now one of the primary goals of observational 
cosmology. The traditional approach has been to survey 
the sky for a population of luminous objects, assuming 
a direct relationship between luminosity and dark mat- 
ter. While large galaxy surveys such as the 2dF Galaxy 
Redshift Survey ('CoUess et al. "2001') and the Sloan Dig- 
ital Sky Survey (York ct al. 2000) are perhaps the first 
examples that come to mind, future wide field X-ray 
llRnmer et al. Il2001[) and Sunvaev -Z'eldovich (SZ) sur- 
vevs l|Carlstrom. Holder, fc R,eesell2002t iKosow skv 2003) 
will usher in an era where maps of the universe will be 
available at multiple frequencies. Generically, all objects 
that emit radiation will be biased tracers of the underly- 
ing dark matter distribution. Hence, the interpretation 
of this data is limited by our ability to quantify the re- 
lationship between dark and luminous matter. 

Weak gravitational lensing, or the coherent distor- 
tion of images of faint background galaxies by the fore- 
ground matter distribution, provides a unique oppor- 
tunity to map the dark matter directly without mak- 
ing any assumptions about how baryons trace dark 
matter (see e.g., Bartelmann & Schneider (2001) or 
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iVan Waerbeke fc Mellieil 1)20031) for a review). Recently, 
weak lensing by large scale structure, or "cosmic shear" , 
has been indepe ndently detected by several groups (see 
Table 1 of lVan W aerbek e fc Melli er ( 2003) for a compila- 
tion of references). The primary focus of these observa- 
tions has been to measure the two point statistical prop- 
erties of the shear field, which probes the matter power 
spectrum in proj ection. However , as first suggested by 
iKaised l|1995D and lSchneideil 1)19961) , weak lensing surveys 
can also be used to detect individual mass concentra- 
tions, allowing one to construct a shear selected sample 
of dark halos. Typically these will be galaxy cluster size 
M > 10^^'^ Mq objects, as the finite number of back- 
ground sources and their intrinsic ellipticity place a lower 
limit on the size of a halo that can be detected. To date, 
there are only two such cases of spectroscopically con- 
firmed cluster detections from 'blank field' weak lensing 
obser vations (.Wit tman e t al. .200 L .2003.) . Although, le- 
centlv lSchirmer et al.l l)2003L 12004)) have used weak lens- 
ing to confirm several color-selected optical cluster can- 
didates. 

A shear selected sample of galaxy clusters would be of 
great astrophysical and cosmological interest. Probable 
biases in optical. X-ray, and SZ selected cluster samples 
with respect to richness, baryon fraction, morphology, 
and d ynamical state, would be revealed ( Hughes ct al] 
12004^. Furthermore, if there does exist a population of 
high mass to light clusters that has hitherto gone un- 
detected, this would cast serious doubt upon the "fair 
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sample" hypotheses (jWhite et al. 11199^ . invoked to de- 
termine the matter density parameter, fJm, from com- 
parisons of cluster baryon fraction to total cluster mass 
m easurements (see e.g. , fAUen. Schmidt. & Bridle (2003) 
or lVoevodkin fe Vik hlinin (2004) for recent examples). 

Perhaps a more contentious issue is the possible 
existence of a population of truly dark clusters. 
Recently, several groups have purportedly detected 
dark mass concent r ations from weak lensing ll Fi scherl 
19991 lErben e t al. 2000: Umctsu & Futamase '2000"; 



19991 ILevine. Schulz. fc Whit3l2fl(illMaiumdar fc Mohil 



Miralles et anr2002; Dahlc et al. .2003, though sec Erbcn 



et al. 2003). If these objects are actually 'dark' virialized 
mass concentrations as opposed to non-virialized ob- 
jects (Weinberg & Kamionkowski 2002) or projections 
of large scale stru cture dM ctzlcr. White. & Lokcn 
iWh ite. v a n Waerbek c. & Mackcv 
Eadmanabha n. Seliak fc PenI |2003f 
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Hamana. Takada. fc Yoshidal \200^ . this would pose 



serious challenges for our current structure formation 
paradigm and theories of galaxy formation. The absence 
of galaxies would require a complex form of feedback 
that suppresses galaxy formation in clusters. If such a 
dark cluster were X-ray or SZ faint, one would have to 
invoke some exotic mechanism to segregate dark matter 
from baryons on Mpc scales. 

It is well known that a large sample of clus- 
ters of galaxies out to moderate redshift z ~ 0.5 
can be used to impose stringent constraints on cos- 
Several authors have re- 
the number count distribu- 
from X-ray and SZ observa- 
tions ijHaiman. Mohr. fc Holder! I200H IW cUcr fc Battvi 
l2003|) , deep opti cal surveys l|Nemnan_£Lal_. ,2002)7 
and weak lensing iBartelmann. Perrotta. fc Baccig-ahipil 
"""" ' ' 3), 



mological parameters, 
cently suggested using 
tion o f galax y clustcrSj_ 



120021 [Weinberg fc Kamionkowskl2003|) . to probe the na- 
ture of the dark energy believed to be causing the accel- 
eration of the expansion of the universe. Besides probing 
the dark energy, these cluster samples would constrain 
the matt er density and ampHtude of mass fluctuations 
(see e.g., 'Bahcall fc Fan'iggg"; 'Holder. Haiman. fc Mohil 
ISdOI; Picrpaoh. Borgani. Scott, fc White 2003). In 
addition, the clustering of such cluster samples 
would provide ano ther means to measure the mat- 
ter power spectrum llXadros. Efstathiou. fc Daltonlll998l: 
iSchuecker et al.ll2001|) which would provide complimen- 
tary parameter constraints ijMaiumdar fc Mohr 2003) 
and might possibly allow for a measure ment of the an- 
gular diameter distance- redshift relation llCoorav et al. I 
1200 ID or baryon wiggles l)Hu fc Haimanll2003j) . 

Yet, a limitation of the 'baryon selection' in optical. 
X-ray, and SZ cluster samples, is that their selection 
function depends on observables (e.g., richness, veloc- 
ity dispersion, flux, temperature, SZ decrement) that 
serve as a proxy for mass. Attempts to model these 
mass-observable relationships depend on the uncertain 
astrophysics of galaxy formation and the state of baryons 
in clusters. While these can be determined empirically, 
scatter in the correlation be tween any two such ob serv- 
ables is significant (see e.g.. iKochanek et"aLll2003() : fur- 
thermore, these mass-observable relations surely evolve 
with redshift. The mass function of galaxy clusters is 
steep and exponentially sensitive to changes in limiting 
mass; moreover, errors in the mass-obs ervable relations 
mimic cosmological parameter changes ijViana fc Liddlj 



2003 ; Hi J 120031) Inclusion of uncertainties in the mass- 
observable relations significantly degrades constraints on 
cosmological parameters obtain ed from surveys that rely 
on a baryonic proxy for mass ( Levine. Schulz. fc Whitel 
l2002HMaiumdar fc Mo hr 2003). Alth ough consistency 
checks provide additional leverage l)Hull2005^ . expensive 
follow up mass measurements are in general required to 
empirically calibrate the relationships between light and 
mass (Majumdar fc Mohr 2003a; though see Lima fc Hu 
2004). 

In this regard, the great advantage of a shear se- 
lected sample of galaxy clusters is that the selection 
function can be predicted ab initio, given a model 
for structure formation. Because on the scales of 
interest for weak lensing, only gravity is involved 
and the mass is dominated by dark matter, pre- 
cision cosmological measurements will in principle 
be limited by instrumental systematics rather than 
unknown astrophysics. An accurate determination 
of the statistics of shear selected clusters from weak 
lensing requires N-body simulatio ns of cosmological 
structure formatio n (though see fKruse fc Schneide^ 
19991 iB artclma iiiTPerrotta. fc Baccigalupil 120021: 
Wein berg fc Kamionkowskin2003l for simple analvtical 
treatments), which brings us to the subject of this work. 

In this paper we consider the detection of shear 
selected clusters using fast Particle Mesh (PM) N- 
body simulations of cosmological structure formation 
specifically tailored to investigate the statistics of 
weak lensing by clusters. Similar numerical stud- 
ies c arried out by other g;roups l |Rgblinsk^^t_aL 
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have focused 

on the completeness and efficiency of weak lensing 
cluster searches. Devising an optimal filtering scheme 
to detect clusters has received little attention and 
furthermore, the additional information provided by 
photometric redshifts of background source galaxies has 
been neglected. In this paper we introduce the fast 
PM simulation algorithm, deduce the optimal cluster 
detection strategy, and investigate cluster tomography. 

A fundamental obstacle for weak lensing studies of the 
matter distribution is the lack of radial information. All 
of the matter along the line-of-sight to a distant source 
contributes to the lensing and so the distortion reflects a 
two dimensional projection of the dark matter. This lim- 
itation can be overcome by using photometric redshifts 
of the background source population, which will be dis- 
tributed in redshift, thus allowing for a tomographic re- 
construction of the three dimensional foreground matter 
distribution. 

A variant of this techni que has already been applied 
to real weak lensing data. iWittman et al. I l)2001ll2'003() 

tomographically determined the redshifts of two shear 
selected clusters at = 0.30 and Zd = 0.70 respectively, 
which were both confirmed by follow up sp ectroscopy to 
be at z = 0.28 and z — 0.68. Recently, iTavlor et alJ 
l)2004l) conducted a weak lensing analysis of the previ- 
ously known Abell superclustcr A901/2 at z = 0.16, and 
tomographically confirmed its redshift from weak lens- 
ing alone. They even constructed the first 3-D map of 
the dark matter potential of this cluster using the inver- 
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sion method introduced bvlTavlod J2001f) . iHu fc Keeton 



Tavloi 



||p02) and iBacon fc Tavlod l|20n3li arolied the 
(|2001i|) tomographic inversion method to analytical mod- 
els of clusters put in 'by hand', with mixed success. How- 
ever, the reliability of these tomographic reconstruction 
techniques have yet to be characterized on realistic dis- 
tributions of matter from N-body simulations; although 
it is clear that the fidelity of the reconstructions will be 
severely degraded by line of sight projections of large 
scale structure. 

In this work we introduce a complementary to- 
mographic technique which is effectively two dimen- 
sional. It is qualitatively similar to that used b 
iWittman etlD (gOOl, 200l) and iTavlor et all pOa 
and also to maximum likelihood methods developed 
to study clu ster mass pr ofiles f Geiger fc Schneider 
R?98; Schneide r. King, fc Erb en 2000 HKing fc Schneided 
I2OOI.') . The efiicacy and reliability of the method is 
investigated by applying it to our large ensemble of N- 
body simulations. Additionally, a Tomographic Matched 
Filtering (TMF) scheme, which combines tomography 
and matched filtering, is introduced which takes ad- 
vantage of the additional radial information provided 
by photometric redshifts of source galaxies. The TMF 
is similar in spirit to the matched filtering algorithms 
use d to find clusters in opt i cal surveys (J ostma n et al . 
Il^96; Ke nner et al.1 11991 iWhite fc Kochanekl ^nflf 
iKochanek et al.l l2003j) . Applying it to our ensemble 
of simulations, we find that it is superior to fil- 
tering techniques used previously (Reblinskv et alj 

1999 fWhitc. van Waerbeke, fc Mack^l 

200a |Padmanabhan. Scliak. fc PenI 12003 

Hamana. Takada. fc Yoshida. .2003,) . detecting more 
clusters per square degree and probing lower masses and 
higher redshifts. 

The outline of this paper is as follows. In §2 we de- 
scribe our implementation of a PM code to simulate weak 
lensing observations. The formalism behind our maxi- 
mum likelihood tomographic technique and the adaptive 
matched filtering method is introduced in §3. In §4 we 
apply the TMF and other filters to a large ensemble of 
simulations and determine the 'optimal' filter for clus- 
ter detection. The completeness of this optimal filter is 
discussed in §5. We assess the reliability of tomographic 
redshifts in §6 and conclude in §7. 

2. SIMULATING WEAK LENSING 

2.1. The PM Code 

To evolve the dark matter distribution into the nonlin- 
ear regime, we use a particle-mesh (PM) N-body code. 
This code, wr itten by Changbom Pa rk, is d e scribe d 
and tested in IPaTkj i|r990() and iPark fc GottI l(T99l|) . 
and has been used in studies o f pecu liar velocities 
ijBerlind. Naravanan. fc Weinberg I 120001) and biased 
galax y formation i^NaravanairBerii nd. fc Weinberg! 
120001: Bcrhnd. Naravanan, fc Weinberel |200H) 

The simulation uses a stagge red m esh to 

compute forces on particles lIMelott I |^8^ 
Centrella. Gallagher. Melott. fc Bushousd 11981 IParlj 



The force on each particle is calculated by finite differ- 
encing the potential field, which is calculated from the 
density field in Fourier space using the kernel — 1/fc^ and 
fast Fourier transforms (FFT) . The gridded density field 
is computed from the particles using the cloud- in-cell 
(CIC ) charge assignment scheme ((Hockney fc Eastwood I 
11981(1 . The simulations are started at 1 -|- z = 50 and are 
evolved using equal steps in the expansion fact or a with a 
sympl ectic leapfrog integrator described in Q uinn et al. I 
|1997). The time step is taken to be less than the 
Courant condition. 

Before we proceed we should justify our use of the PM 
algorithm for the N-body simulations. It is well known 
that the PM algorithm is 'memory limited' in the sense 
that higher spatial resolution comes at the cost of stor- 
ing an exceedingly large force mesh in random access 
memory. The main drawback of PM simulations is thus 
limited dynamic range, whereas the primary advantage 
of the algorithm is speed. We compensate for the lack 
of dynamic range by adopting the "tiling" algorithm in- 
troduced bv .Whi te fc Hu (2000), whereby the light cone 
is tiled with a telescoping sequence of N-body simula- 
tion cubes of increasing resolution. The speed of the PM 
algorithm is essential for characterizing the statistics of 
shear selected clusters for the following reasons. 

First, massive clusters of galaxies are rare events. 
In order to quantify their statistical properties it is 
essential to properly sample the primordial Gaus- 
sian distribution of random phases of the large scale 
structure along the light cone. Similar numerical 
studies using N-bod y simulations (.Reblinskv et all 

1999t iwhite. v an Waerbeke. fc Mackeyl 

200a IPadmanabhan. Seliak^ fc PenI 12003 

Hamana. Takada.^^^shidal I2003|) recycle the out- 



1990|) . and those employed here use 256^ particles and 
a 512"^ force mesh. 

The initial conditions are generated by displacing par- 
ticles from a r egular grid using the Z'eldovich approxima - 
tion (see e.g., lEfstathiou. Davis. White, fc Frenkl[T985|) . 



put of one simulation by reprojecting across the same 
simulation cube numerous times, hence avoiding the 
computational challenge of simulating the entire Gpc^ 
volume. However, the frequency of one very rare event 
could be grossly misrepresented by such a procedure, 
and there is a danger that the variance and tails of 
the mass and redshift distributions of clusters will be 
incorrect. This misrepresentation is exacerbated by the 
fact that the primary leverage of cluster counts as a 
cosmological probe often comes from precisely those 
objects on the exponential tail of the mass function. To 
avoid this problem we tile nearly the entire light cone 
volume with unique simulations, clearly requiring a fast 
algorithm. 

Second, a primary goal of this simulation program 
will be to explore how the distribution of shear se- 
lected clusters depen ds on cosmological parameters 
l|Hennawi fc Snergell i200l . which requires simulating 
many cosmological models that span a large region of 
parameter space. 

Finally, high resolution simulations are not re- 
quired to accurately represent weak lensing because of 
the small scale sh ot noise limit of the observations 
l)White fc Hi J l200dl) . This can be un derstood heuris- 
ticall y from simple scaling arguments (iMiralda-Escudd 
Il991t iBlandford. Saust. Brainerd. fc VillumsenI Il991tl . 
For a galaxy cluster with a singular isothermal profile, 
the effective lensing signal, jVN, scales logarithmically 
with 9, the size of the smoothing aperture. This loga- 
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rithmic scaling ensures that the dominant contribution 
to cluster weak lensing comes from scales of order the 
smoothing aperture, rather than much smaller scales 
which are not resolved. Since the noise in the obser- 
vations we simulate will require averaging over apertures 
> 1', resolving small scale power in the simulations is not 
essential. 

All simulations described in this paper are carried out 
using the currently favored cold dark matter model with 
a cosmological constant (ACDM). The parameters we 
simulated are a total (dark matter + baryons) matter 
density parameter fim = 0.295, a density of baryons 
rib = 0.045, a cosmological constant J^a = 0.705, and 
a dimensionless Hubble constant h = 0.69, and normal- 
ization parameter erg = 0.84, which are very close to 
the WMAP best fit model of .Spergel et a l. (2003). The 
power spectrum is given by a scale invariant spectrum 
of adiabatic perturbations (n = 1), an d for the transfer 
function we use the fitting function of lEisenstein fc Hul 

2.2. Tiling the Line of Sight 

In what follows we describe our implementation of the 
IB^hitc & Hu (2000) algorithm to tile the hne of sight 
with PM simulation cubes. (For a comprehensive dis- 
cussion of numerical issues in weak lensing simulations , 
see e.g..l.TaiTi. Seba.k. Whitfil200?ilVale Whitfll200a 
IWhite Valell2nmF. 

The distortion of a source galaxy at redshift z in the 
direction n on the sky is determined by the shear field 
7(n, z) of the matter between the observer and redshift 
z. In the weak lensing approximation, the shear is com- 
pletely specified by the convergence between the observer 
and comoving distance = D (z), given by 



K(n, Z) = ^ 



-Dz 



D \ S(Dh,D) , , 



and 



dD 

dz 



H{zy 



(2) 



whic h is valid for small angles in the Limber approxima - 
tion il.Tain. Seliak. fc White! 1200(1 IVale fc Whit3l2?ffll . 
Here S is the density contrast field, a = 1/(1 -I- z) is the 
scale factor normalized to unity today, and H{z) is the 
redshift dependent Hubble constant. The shear can then 
be obtained via the Fourier relations 
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where k is the two-dimensional Fourier Transform (FT) 
of the convergence field, and 1 = (^1,^2) is the Fourier 
variable con jugate to pos i tion o n the sky. 

Following Iwhite fc Hul l)2000|) . we tile the line of sight 
with a sequence of telescoping simulation cubes of suc- 
cessively higher resolution. The integral in eqn. is 
directly integrated across the tiles to the observer at 
z = using the overdensity field 5 from the simulations. 
Specifically, for each tile along the light cone, the matter 
distribution is evolved from 1 -I- z = 50 to the redshift 
Zican corresponding to the next segment of the integral 



in eqn. |^ and the rear of the current simulation cube. 
In addition, we begin calculating a new convergence in- 
tegral, K(n, Zroar) from Zrear to the obscrvcr at each tile, 
so that the final output of our simulation is a sequence 
of convergence planes densely spaced in redshift. 

By evaluating the full convergence integral from the 
tiling sequence, the geometry of the rays and the evo- 
lution of the potential are accurately represented. The 
lines of sight originate on a square grid at Zrcar and con- 
verge on an observer at z = 0. The planar lattice has the 
same dimensionality as the force grid iVgrid = 512. For 
staggered mesh PM algorithms, the effective resolution 
of the simulation is set by the particle grid iVpart = 256; 
however, as we will be interested in the densest regions 
occupied by clusters of galaxies, there will be no danger 
of undersampling the particle distribution and the use 
of the finer grid is justified. We work in the small an- 
gle approximation, so that the sky can be treated as flat 
and hence all photons propagate along the z-axis of our 
simulation cube for all lines of sight. The convergence of 
the geodesies as the light cone narrows is implemented 
by multiplying by the appropriate geometric factor for 
each grid point, and by the tiling. Also as mentioned 
previously, the use of a unique simulation for each tile 
ensures the statistical independence of the fluctuations 
and accurately represents the statistics of rare events. 

The tiling sequence is uniquely specified by an open- 
ing angle and a maximum and minimum redshift. The 
redshift z^ax = 4 is the highest source redshift for which 
we evaluate the convergence. The source redshift distri- 
butions we consider peak at z ~ 1.0, thus there will be 
a negligible number of source galaxies at redshift higher 
than Zniax- Below Zmin = 0.2, we tile the remainder of 
the light cone with the same simulation, recycling the 
output of the last cube. This is necessary because we 
cannot shrink the box size beyond the nonlinear scale, 
otherwise the PM code will not evolve the density cor- 
rectly because of the absence of mode coupling to waves 
larger than the box. Furthermore, below Zmin — 0.2, 
it becomes computationally impractical to continue to 
shrink the box and tile the light cone with successively 
smaller unique simulations. The time step for the last 
leg of the simulation with z < Zmin is adjusted so as to 
resolve the next segment of the convergence integral in 
eqn. (P). 

To maintain statistical independence of the fiuctua- 
tions during the short sequence where we recycle out- 
put, we project across a different axis of the simulation 
each time and the convergence grid is displaced by a ran- 
dom offset relative to the density grid. Rays that leave 
the box are remapped by periodicity. The sizes of the 
simulation cubes in our tiling sequence are chosen to ex- 
actly reproduce the comoving volume of the light cone 
for Zmin = 0.2 < z < Zmax — 4, ensuring that the num- 
ber of cluster size halos in the field of view is accurately 
represented. Accurately reproducing the volume of the 
light cone is not essential for studying two point statis- 
tics, but is crucial for predicting cluster statistics and 
may become increasingly relevant for higher order statis- 
tics which depend more sensitively on rare events. Below 
•^min = 0.2, we misrepresent the volume and geometry of 
the light cone. The extra volume implies a slight excess 
of clusters with Zmin < 0.2 and the slab geometry for this 
last stretch results in a deficit of small scale power in the 
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convergence field, since the slabs probe larger scales than 
the converging light cone. Both of these discrepancies 
have a negligible effect on the tomography and filtering 
which we study in this paper. 

Our simulation fields are 4° on a side and the angu- 
lar pixel size of the convergence grids is 4°/512 = 0.47'. 
The simulations use 46 tiles of which 32 are unique PM 
simulations (the last 14 tiles are recycled from the same 
simulation). Relevant parameters for the simulations in 
our tiling scheme are listed in Tabled The right panels 
of Figure n show the dimensionless 3-d mass power spec- 
trum, A^j^gg(fc, Zrcar), for four tiles, obtained by averag- 
ing individual power spectra from 38 independent simu- 
lations. They are compared to analytical fitting formulae 
for the nonlinear power spectrum using the prescription 
of Smith et aLC2QD.3) - 

From Tabled is is apparent that we are forced to sim- 
ulate small simulation cubes at low redshift, which re- 
flects the need to achieve subarcminute angular resolu- 
tion while reproducing the volume of the light cone above 
2min = 0.2. However, there is a danger that small cubes 
misrepresent the evolution of structure if the fundamen- 
tal mode of the simulation goes non-linear, because of 
mode coupling to smaller scales that occurs during non- 
linear evolution. However, note that the nonlinear scale 
Tni = 27r/A;„i at z = defined by 

/'"'ALssW'^lnfc^l, (4) 
Jo 

is Till = 20.3 Mpc for the cosmological model used in 
this paper, where A^^^^g is the dimensionless linear power 
spectrum. Accordingly, because even our smallest cubes 
Lhox = 40.0 for z < Ziiiiii are larger than the nonlinear 
scale, finite box effects should not be an issue. 

2.3. Group Catalogs 

For each tile used to evaluate the convergence, the par- 
ticle distributions are dumped and a halo catalog is pro- 
duced by running a "friends-of-friends" (FQF) group 
finder^ (see e.g., iDavis. Efstathiou. Frenk. fc Whitd 
119851) with the canonical linking length (in units of the 
mean interparticle separation) b = 0.2. The EOF al- 
gorithm groups the particles into equivalence classes by 
linking together all pairs separated by less than b. We 
impos e a minimum halo ma ss of 10^^'^ Mq, where we 
follow Ijenkins et al.l l)200lD and define the mass of a halo 
as the mass of all the particles in the EOF group (with 
b = 0.2). Note that this minimum mass corresponds to 
different numbers of particles for different tiles, as our 
larger tiles have poorer mass resolution. A cluster is de- 
fined as a halo with M > 10^^-^ h^^ Mq. It is clear from 
Tabled that ahhough our mass resolution varies from 
Mpart --3xlO^-2xlO"/i-iMo from the smallest 
tile to the largest, it is always sufficient to resolve cluster 
size halos. As we will see in §3, weak lensing selected 
clusters span the redshift range 0.2 < z < 1.0, so that 
we also spatially resolve the virial radii of cluster halos 
in this range. 

Although there exist several methods to define the cen- 
ter of a FOE group, such as the location of the den- 
sity (potential) maximum (minimum), these require an 

We used the University of Washington NASA 
HPCC ESS group's p ubUcly available FOF code at 
http : //www-hpcc . astro . Washington . edu 
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Tiling Solution 
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NOTES. — Parameters for the 46 simulation cubes in our tiling 
solution for the ACDM model. The sequence comprises output 
from 32 unique PM simulations (the last 14 tiles recycle the same 
simulation). The column column Zicar corresponds to the red- 
shift at which the particle distributions of each tile are output. Is 
is also the source redshift the densely spaced convergence planes 
K(n, 2roar). The size of the simulation cube used for that tile (in 
comoving Mpc), the grid spacing (in comoving h^^ kpc), and 
the particle mass (in Mq) are also given. 

~ A^group calculation for each group, which is not com- 
putationally feasible for a large ensemble of simulations. 
Instead we opt for a simpler, faster definition of the group 
center which scales as A^group- A sphere of radius rgphcrc is 
placed at the location of a random particle in the group, 
and the center of mass of the particles interior to the 
sphere is calculated. Moving the sphere center to this 
new location, the procedure is iterated until the sphere 
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center converges to the center of mass of the particles in- 
side of it. We set rgphoro = 0.4 h^^ Mpc of order the size 
of a cluster virial radius, which gives halo centers that 
match the peaks in our lensing maps accurately. This 
moving sphere method is more robust than using the 
center of mass of all the particles in the EOF group be- 
cause the latter quantity often does not coincide with the 
density maximum for disturbed or elongated structures. 

The left panel of Eigure ^ compares the differential 
mass function for several tiles, again averaged over 38 
simulati ons, with the univers al mass function fitting for- 
mula of I.Tenkins et al.l l|200lD . The close agreement be- 
tween the mass function in the tiles and the fits indi- 
cate that our simulations robustly reproduce the mass 
function down to 10^'^ M©. Eigure |3 shows the cu- 
mulative number of halos in the light cone, dN{> M — 
10^^ h^^ MQ)/dz as a function of redshift, averaged over 
the 38 16 deg^ fields, again compared to fitting formula. 
Because we have been careful to accurately reproduce 
the volume of the light cone, the total number of clusters 
as a function of redshift is reproduced for 0.2 < z < 2.5. 
Below Zmin = 0.2, we overproduce clusters because of 
the excess volume of our simulations where our tiling be- 
comes inefficient. Above z ~ 2.5, there is a tendency to 
underproduce collapsed objects because the resolution 
of our larger tiles approaches the size of the virial ra- 
dius Tvir — 1.0 Mpc of clusters. As the efficiency for 
lensing is only appreciable for 0.2 < z < 1.0, the overpro- 
duction of clusters at z < 0.2 and under production for 
z > 2.5 will not effect our conclusions on clusters from 
weak lensing. 

2.4. Mock Observations 

The observables in weak lensing observations are the el- 
lipticities of background source galaxies which have been 
distorted by the foreground matter distribution. Even 
for the ideal case of no instrumental noise, observations 
are limited by the intrinsic ellipticities of the sources and 
the finite number of Poisson distributed sources on the 
sky. Here we simulate this ideal case by drawing a num- 
ber density n of source galaxies from the source redshift 
distribution 

= -Lz^e-^/^o^ (5) 

ZZq 

and placing them at random positions on our 4° x 4° sim- 
ulated field. This redshift distribution peaks at 2zo, has 
mean redshift (z) = 3zn, and has been used in previous 
studies of cosmic shear l)Wittman et al. ir2000|) . A frac- 
tion fz of source galaxies will have photometric redshifts, 
where the errors are drawn from a Gaussian distribution 
with dispersion az = Sz{l + z) and added to the source 
redshifts. We take n = 40 arcmin"^, zq = 0.50 corre- 
sponding to a peak redshift of z = 1.0, and 6z ~ 0.12. 
Henceforth in this work we set fz = 1 and assume all 
source galaxies have photometric redshifts in order to 
determine the efficacy of our tomographic and adaptive 
matched filtering techniques for the best case. 

The intrinsic ellipticities, tint, of the source galaxies are 
drawn from a Gaussian distribution with a single com- 
ponent rms dispersion 7rms = 0.165 (Bernstein & Jarvis 
1200 2?). Note that there has been some confusion in the 
literature over the intrinsic ellipticity, and larger values 
have been employed in other studies. The value used 



100 




M [h"' M,„] k (h/Mpc) 

Fig. 1. — Mass functions and 3-d power spectra for four 
tiles. From top to bottom the redshift of the tiles 
1.52,1.01,0.60,0.20 (see Table^lfor simulation parameters). The 
left panels show the mass function (histogram) averaged over 38 
different simul ations of the same t ile compared to the fitting for- 
mula (solid) of ljenkins et alj|20011) . The right panels show the 3-d 
power spectra of the density field for each tile. Solid lines are the 
power spectra averaged over 38 simulations. Dashed curves show 
the nonli near power spectra from the analytical fitting formulae of 
ISmith eT al.l I200S) and dotted curves are the linear theory power 
spectra. 



here is achievable if the o ptimal weighting discussed in 
ijBernstein fc JarvT^ 12002(1 is applied to the sources for 
the asymptotic case of no instrumental noise (Gary Bern- 
stein, private communication). 

The simulations output convergence planes ^(n, Zrcar) 
for < Zrcar £ 4 densely spaced in redshift as listed 
in Tabled and the shear fields 7(n, Zj-car) are obtained 
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1 2 3 

z (redshift) 



Fig. 2. — Total number of clusters above M = lO^^-^ Mq 
in a 16 deg~^ patch of sky as a function of redshift. The histogram 
shows the redshift distribution averaged over 38 simulations. The 
solid li ne is the redshift dist ribution obtained from the fitting for- 
mula of lJenkins eTail 12001]) . 



from the convergence via FFT using eqns. l|2Jl. A source 
galaxy at redshift z is sheared by the nearest shear plane 
with Zicar < 2, via the weak lensing relation 



Cint + 7 1 



(6) 



which is valid for k <C 1, I7I <C 1. The shear is inter- 
polated from the grid onto the source galaxy position 
using bicubic spline interpolation. The end result is a 
mock catalog of source galaxy ellipticities with photo- 
metric redshifts, which we use to explore the statistics of 
clusters below. 

In what follows we will want to compare mock obser- 
vations with the noise properties described above with 
noiseless weak lensing observations in order to assess 
the intrinsic limitations of tomography and weak lensing 
searches for clusters. We define noiseless observations to 
be an infinite number of source galaxies with zero in- 
trinsic ellipticity and photo-z error (i.e. 7rms = 0.0 and 
5z = 0) drawn from the source redshift distribution in 
eqn. ((SJ, and placed on the simulation grid rather than 
at random positions. Placing the source galaxies on the 
grid removes the Poisson clustering of the source galax- 
ies, so that the angular resolution is then limited by the 
resolution of the simulation rather than shot noise. Be- 
cause we cannot simulate an infinite number of sources, 
we draw 50 galaxies from eqn. Q for each of the 512^ 
grid points, which we find is sufficient to converge to the 
limiting case of no noise. The left panel of Figure|Slshows 
a Kaiser-Squires reconstruction (Kaiser & Squires 199|) 
of the mean convergence field for noiseless data. 

2.5. Matching Peaks with Clusters 

Given a mock catalog of source galaxy ellipticities, we 
construct smooth maps by convolving the data with some 
filters. Candidate mass selected clusters will correspond 
to the peaks in these smoothed maps. We elaborate on 
the filtering techniques in the next section, but focus 
here on the details of matching the peaks in smoothed 
maps with clusters from the simulation tiles. We locate 
peaks in the maps using a simple algorithm whereby a 



peak is identified if it is higher than all of its neighbor- 
ing pixels. While more complex algorithms exist to find 
peaks in pixelised data, because our maps have already 
been smoothed, searching for local maxima is sufficient. 
Given a list of peaks from the smoothed map and the 
cluster catalog constructed by applying EOF to the sim- 
ulation tiles, we can correlate the peaks with the cluster 
catalog and locate all halos along the line of sight of a 
given peak. A peak in a map is considered a cluster de- 
tection if there is a cluster in the light cone within an 
aperture 6'niatch = 3' centered on the peak. Although it 
might seem more appropriate to set this aperture size 
to the field of view of some 'follow up' instrument, 3' is 
well matched to the angle subtended by the virial radius 
of a typical cluster at z = 0.4, a typical redshift for a 
shear selected cluster. Further our interest here is in the 
primary object responsible for the lensing signal and not 
objects at larger angular separation that happen to be in 
the vicinity of the peak. In the event that there are mul- 
tiple clusters within the aperture centered on the peak, 
we take the most massive cluster to be the 'match' to 
that peak. Depending on the method of follow up, the 
object most likely to be detected will be some function of 
mass and redshift, though we neglect this subtlety here. 
Finally, to avoid the problem of multiple peaks in the 
map corresponding to the same cluster, we keep only the 
highest peak in an aperture ^iso = 1' centered on each 
peak, so that less significant peaks near higher peaks are 
discarded. 

3. TOMOGRAPHY AND MATCHED FILTERING 



Previous s tudies (Reblinskv et aL 

19991 iWhitg, y^n~aerbcke,_ fc Mackeyl 

" 2002t iPadma nabhan! Sciiak! fc Penl l200a 



Hamana. Takada. fc Yoshidai i2003^) . have neglected 



the extra information provided by photometric redshifts 
of source galaxies when searching for clusters in weak 
lensing data. Without redshift information, the source 
galaxy ellipticities provide a noisy measure of the mean 
shear 

(7) 



7(n) = J dzpz{z)j{n,z), 



which is the shear out to a given redshift averaged over 
the source redshift distribution Pz{z). Knowledge of the 
source redshift enables one to measure the shear more 
accurately, and in this section we present a tomographic 
matched filtering scheme which fully incorporates this 
extra information. 

The tomographic matched filtering (TME) technique 
is similar in spirit to matched filtering algorithms 
used to find clusters in opti cal surveys j^ ostman ct ^ 
p96; KcDuer ct aljJiMi iWhite & Kochanekl l200a 
iKochanek et al.. .200^ and is also qualitatively sim- 
ilar to the maximum likelihood techniques devel- 
oped to study cluster mass profiles from weak lens- 
ing (Geiger & Schneider 1998; Schneider, King. & Erbe3 
120001 IKing fc Schneideii |2OO10. In what follows we 
present a well-defined procedure for identifying clusters 
of galaxies in weak lensing surveys that makes optimal 
use of both the shape and photometric redshift informa- 
tion of the background source galaxies. It can be ap- 
plied to weak lensing data for which only the overall 
source redshift distribution is known, for which photo- 
metric redshifts of the source galaxies are available, as 
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well as combinations of the two scenarios (i.e. a fraction 
of galaxies with photometric redshifts and the rest with 
only shapes). Furthermore, we will see in §4 that a few 
source redshift bins perform nearly as well as full pho- 
tometric redshift information, so simple color cuts and 
apparent magnitude priors can be used to extract the 
extra tomographic information. 

The matched filter identifies clusters in weak lensing 
data by finding the peaks in a cluster likelihood map gen- 
erated by convolving the source galaxy ellipticities with a 
filter that models the distortion caused by the foreground 
mass distribution. The peaks in the likelihood map will 
correspond to the locations where the match between the 
cluster model and the data is maximized, hence giving 
the two dimensional location of the cluster. In addition, 
the algorithm produces a tomographic estimate of the 
cluster redshift using the lensin g signal, similar to the 
tomograp hic technique used bv IWittman et al. I l)200lL 
12003:) and lTavlor et"al] lIMil . 



3.1. Formalism 

The goal of the matched filter is to match the data to a 
model that describes the distortion of background galax- 
ies by the foreground mass distribution, here a galaxy 
cluster. For the sake of generality, we parameterize the 
cluster lens by three quantities, a cluster redshift Zd, an 
angular scale Og, and an amplitude A. 

For a cluster at redshift Zd the convergence for a source 
galaxy at position Oj and redshift Zj can be written 



where 



and 



K{9j, Zj) = Z{zj; Zd)Kao{Oj) 



Z{z; Zd) 



-H{z ~ Zd), 



(8) 
(9) 

(10) 



Here i;crit(zd, ^s) 



is the critical density for 



^critV-a, -sy — 4^(3 jj^^^ 

lensing with D^, D^s, Ds the defiector, deflector-source, 
and source distances, respectively. The Heaviside step 
function accounts for the fact that sources in the fore- 
ground of the deflector are not lenscd, and the con- 
verg ence for a hypothetical source at infinite distance (sec 
e.g. iSeitz fc Schneideill997HKing fc Schneide r 2001, for 
similar treatments). The same relationship holds for the 
shear, 7(2;) — Z(z;zd)7oo- For a flat universe, eqn. ^ 
simplifies to 



Z{z; Zd) = max ( 1 — — ^, 



(11) 



In the weak lensing regime k ^ 1, I7I <C 1, the ob- 
served complex ellipticity of a source galaxy is related to 
the shear field by eqn. For a spherically symmet- 
ric mass distribution centered about the position ^o, the 
shear will be entirely tangential 

j{e) = -j^i9;e'o)e^^^ (12) 

where is the azimuthal angle about the position Oq- We 
employ the standard definition of the tangential shear 

7^(^; 00 ) = -[71 cos(20) + 72 sin(20)] 

= -^[j{e + 0'o)e-^'*] (13) 



and analogously for the tangential component of e, and 
hence the tangential ellipticity is related to the tangential 
shear by = ej^^ -I- 7^. For our model cluster lens we 
take the mass distribution to be spherically symmetric, 
and henceforth work primarily with the tangential shear 
and tangential ellipticity. 

For a cluster centered at position with the parameter 
vector {Z(i,9s,A) the convergence can be written 

K,-nodc\{0j,Zj;Zd,9s,A) ^ (14) 

^crit 

= Z{zj; Zd)A K{xj), 

where Xj = \9j — 0o\/9s, K{x) is the cluster convergence 
profile in units of 9s, and the amplitude A is given by 

Soi^d 



^~ c2/47rG' 

where Eg is the surface density normalization. 
The tangential shear for this mass distribution is 

li-nodc\{9j,Zj; Zd, 9s, A) : 
where G is related to K by 



(15) 



Z{z.y,Zd)AG{x,) (16) 



G{x) = - / K{y)ydy-K{x), 



(17) 



from the relation 7 = R — k. We omit the tangential 
superscript on 7modei and emodci for notational simplicity. 

The probability of measuring a source galaxy with tan- 
gential ellipticity at position 9j with source redshift 



given the lens model in eqn. (|16l) is 



2vra| 



: exp 



1 e 



7modcU 



(18) 



The measurement error (assumed to be Gaussian) f7| 
in principle includes contributions from the intrinsic el- 
lipticities of the source galaxies as well as an error term 
due to the photometric redshift error of the source galaxy 
at Zj , but in practice the former will dominate because of 
the the slow variation of the function Z (z; Zd) in eqn. Hll|l 
with source redshift. 

If the only knowledge of the source redshift comes from 
the aggregate source redshift dis tribution, one must work 
with the first moment of Z{z) ijSeitz fc Schneide3ll997t 
Sing & Schneider 2001), {Z) = J dz p^{z)Z{z; Zd) where 
Pz is the source redshift distribution in eqn. In this 
case, {Z) must be substituted for Z in eqn. 116|l . 

The likelihood of finding a cluster at position 9o with 
parameters (zd, 9s, A), is the product over all the data of 
the individual probabilities 

(19) 



C{9^o,Zd,9s,A)^YlP, 



and the log likelihood is 



ln£ 



7model)^ 



2 ^ 

j 



(20) 



Expanding the expression in eqn. (|20|l and dropping 
terms that do not depend on the parameter vector gives 



1 (2e^7modci - 



'^modcl) 



(21) 
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This expression has a linear dependence on A, which can 
be exploited to lower the dimensionality of the fit. Set- 
ting the derivative of eqn. (|21|) with respect to A to zero 
gives 



1 -^ZG 



Substituting this back into eqn. (|21|l finally gives 



(22) 



(23) 



Thus, the log likelihood is a convolution of the data 
with a 'matched filter', with each source galaxy inverse 
weighted by its respective error. The technique is 'adap- 
tive' in that it searches for parameters {zi^9s,A) that 
maximizes the contrast between the cluster and the back- 
ground noise, making full use of the additional informa- 
tion provided by photometric redshifts of source galax- 
ies, which is reflected in the redshift 'weights' used in 
eqn. (fl^ . 

Ideally, one would perform the convolution in eqn. H23|l 
with the fa st algorithms introduced and applied to weak 
lensing by iPadmanabhan. Seliak. fc PenI l)2003|) . which 
do not require spatial binning. Here for simplicity, we 
opt to use FFT methods. Doing the convolutions with 
FFT's requires binning the source galaxies both spatially 
and in redshift. In the event that the scale of the matched 
filter, 9s, is kept fixed when maximizing the likelihood, 
then the convolution in eqn. (|23|l can be done with a 
single FFT for each redshift bin, as w e see below. 

For the spatial binning wc follow iSeitz fc Schneider! 
l(l996^) and write the complex ellipticity at any gridpoint 
k as 



with 



Wj — exp — - 



2A02 



(24) 



(25) 



where is a smoothing length. 

In choosing the source galaxy redshift bins, we take 
Pz{z) into account and require each bin to have an equal 
fraction of the probability. Because the function Z [z] Zd) 
in eqn. varies slowly with redshift for z > Zd, this 
binning can be relatively coarse. We consider two dif- 
ferent binnings: a coarse binnning with only 3 bins, as 
might be achieved by using as little as two colors and an 
apparent magnitude prior, and a finer set of 5 bins which 
would require multi color data. We don't simulate the 
effects of errors in assigning galaxies to their respective 
bins. The spacing of the bins is chosen to contain equal 
probability as given by the source redshift distribution in 
eqn. (jSJ. The redshift ranges of the bins and the central 
redshift are listed in Tabled We will henceforth refer to 
the TMF with three coarse bins as the TMF3 and that 
with five finer bins as the TMF5. 

Combining eqns. (|22|l and (|24|l with the log likelihood 
in eqn. gives 

1 (Y.T^' Z{z,,Zd)YZ' elG{x,))' 
\iiC{9a,Zd,9s''- ^ ' 



2^2 z\z,\^d)Y.7 G-^ixk) 



TABLE 2 
Source Redshift Binning 



Type 


^low 




^center 







- 1.02 


0.70 


Coarse 


1.02 - 


- 1.72 


1.34 




1.72 - 


oo 


2.28 







- 0.77 


0.55 




0.77 - 


- 1.14 


0.96 


Fine 


1.14 - 


- 1.55 


1.34 




1.55 - 


- 2.14 


1.81 




2.14 - 


oo 


2.66 



NOTES. — Source redshift binnings used in this paper. The 
"Coarse" binning uses three redshift bins, while the "Fine" bin- 
ning uses five. Bins are chosen to contain equal probability from 
the source redshift distribution in eqn. |F] Lower, upper, and cen- 
tral redshift are denoted by ^lowi ^high, and ^center, respectively. 



Here e^. is the tangential ellipticity about 6*0 at grid- 
point k for source galaxies in the ith redshift bin. The 
number of grid points and source redshift bins are de- 
noted by Ug and Uz respectively. The sum over i ex- 
tends to rix -I- 1 to indicate that the last bin will be 
those galaxies that do not have photometric redshifts, 
for which the mean value of {Z){zd) must be used. For 
simplicity we have taken all the ellipticity errors to be the 
same, a = a^. This can be easily generalized to the gen- 
eral case of different errors for each source galaxy, which 
would amount to modifying eqn. I|24(l to take the differ- 
ent errors into account. Note that the sum over k in the 
numerator of this last expression for the likelihood is just 
a convolution of the tangential shear with the 'matched 
filter' G{xk) which can be done using an FFT. 

If we define 



Mi{9o) = 



r^- E?^r.G(x.) 



1/2 ' 



[E7 G^ix,)] 

then the likelihood can finally be written 



h-iC{9o,Zd,9s) = 



1 [Ei Z{zf,zM{9o] 

2^ ^r^^zHl^ 



(27) 



(28) 



where Miifii^) is the convolution of the ellipticity of 
the source galaxies in the ith redshift bin with the 
normahzed kernel G{xk) /[^f^G'^{xk)Y^'^ ■ If there is no 
photometric redshift information, Uz = 0, the redshift 
weight factors of (Z)'^(zd) in the numerator and denom- 
inator of eqn. (|28() will cancel, and the log likelihood 
just reduces to the square of a single map M{9o), 
which is a convolution of the tangential shear with the 
normalized kernel. It is only this limiting case which 
has previously been considered by stu dies that neglect 
photo metric redshift information l |Reblinsk^^et_al 



1^ 



2002 



J^hite. van Waerb eke. fc Macke- 



IPadmanabh an. Seliak, & Pe: 



Haman a. Taka da. k Yoshidall2003|) . 



(26) 



The expression for the likelihood in eqn. H28|l can be 
maximized at each point on the sky, allowing one to cre- 
ate a cluster likelihood map. Clusters will be located at 
the peaks in this map and the maximum likelihood Zd 
will be the tomographic estimate for the lens redshift. 

An example of a likelihood map constructed from 
binned source galaxies is depicted in Figure |3| The 
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left panel shows a Kaiser-Squires reconstruction of the 
mean convergence field (recall eqn. d) for the case of 
no noise. The right panel shows the cluster likelihood 
map constructed by applying the TMF to mock data 
which includes noise. Three coarse source redshift bins 
have been used to make this map (TMF3). For the 
function G{x) we have used the 'optimal filter' deter- 
mined in §4 below (eqn. with a scale angle, 0s = 
0.50', and gaussian truncation radius, ^out = 5.5'), 
which maximizes the number of clusters detected. This 
filter approximates a projecte d Navarro-Frenk- White 
iNavarro. Frenk. fc Whit^ lfl997h (NFW) profile, but is 
truncated with a Gaussian to prevent the likelihood sum 
from being diluted by distant uncorrelated source galax- 
ies. Peaks in the likelihood map are candidate shear 
selected clusters and four of the more significant peaks 
in this map are circled and labeled a-d. 

The probability distributions of the tomographic red- 
shift for each of these four peaks is shown in Figure^ To 
create the likelihood map in Figure O the source galax- 
ies were binned spatially and in redshift as in eqn. (|26|) : 
however, to compute the probability distributions shown 
in Figures 01 the convolution in eqn. H23|l is performed 
exactly in real space with no binning. The mass and 
redshift of the cluster (s) responsible for the peaks are la- 
beled, and the vertical dashed lines indicate the true lens 
redshift (s). 

Although we will consider the accuracy of the tomo- 
graphic redshifts in detail in §5, the examples in Figure 
01 illustrate some qualitative features of the tomographic 
technique which we elaborate on them briefly. 

Peak a corresponds to a fairly massive cluster at z = 
0.29 and the tomography performs well, giving a tomo- 
graphic redshift of z = 0.26. For the higher redshift 
cluster coincident with peak b, at 2; = 0.70, the tomog- 
raphy gives Zd = 0.95, which is incorrect, but the tomog- 
raphy succeeds to indicate that the deflector is at high 
redshift, which is still helpful as the source galaxies can 
be weighted for a high redshift deflector, increasing the 
contrast between the cluster and the noise. 

Projections of several halos at different redshifts are 
quite common, which is illustrated by peak c which is 
a projection of a 2 x 10^"* Mq cluster at 2 = 0.27 
and two large group size halos of 5 x lO^-^ h^^ Mq at 
z = 0.53 and z = 1.06, respectively. Notice in Figure 
^ that the tomography gives a deflector probability dis- 
tribution with two likelihood maxima at redshifts corre- 
sponding to the two redshifts of the lower redshift halos. 
The fact that the peaks in the probability line up with 
the deflector redshifts is somewhat of a coincidence, since 
we are fitting a model to the shear with only a single de- 
flector. 

Finally, as we will elaborate on further in the next sec- 
tion, mass selected cluster samples are plagued by pro- 
jection effects due to large scale structure and small halos 
along the line of sight. Although the peak labeled d is 
the highest peak in the likelihood map, there is no cor- 
responding halo in the light cone above lO^'^'^ Mq 
within an aperture of radius 3' (which is our peak-halo 
matching criteria). However, there is no indication that 
this peak is a projection from the tomography, which 
gives a probability distribution similar to the others in- 
dicating the tomographic redshift is Zd = 0.36. It is 
perhaps not surprising that Zd ~ 0.4, when one consid- 



ers that the lensing efficiency peaks at this redshift, and 
large scale structure along the line of sight weighted by 
this lensing kernel is likely to yield a tomographic redshift 
at its peak. 

Although the tomographic redshift probability distri- 
bution radially resolved two projected clusters for peak 
c, it led us to believe that peak d was a shear selected 
cluster just like any other, at Zd = 0.36, when in reality 
it was a projection of large scale structure. A detailed 
investigation of the degree to which cluster tomography 
can distinguish projections from real clusters is beyond 
the scope of this work. If it were possible to make a such 
a distinction based on the tomography, one could imagine 
flagging certain these peaks, so that a shear selected clus- 
ter sample could be cleaned of such projections, which 
will complicate attempts to determine cosmological pa- 
rameters from the redshift distribution of shear selected 
clusters. Of course, the projection hypothesis can also be 
tested by searching for overdensities of early type galax- 
ies (i.e. cluster galaxies associated with the lenses) as a 
fu nction of pho t ometri c redshift 

lIDahle et all 1200^ ISchirmer et all l2003l I200I 
iTavlor et a,lJl2004D . 

4. THE OPTIMAL FILTER 

4.1. Discussion of Filter Functions 

In this section we will consider several filter functions 
G{x) to convolve the tangential shear data with for the 
TMF introduced in §3. But first, a few general remarks 
about searching for clusters in weak lensing data are in 
order. 

Like the C statistics 

ijFahlman. Kaiser. Squires, fc Woods! I1994D and the 
aperture mass measures M.p ijSchneideil 119961. The 

TMF has the virtue that it operates directly on the 
shear, which is the observable quantity in weak lensing 
observations. Some studies of shear selected clus- 
ter detection search for clusters by first performing 
a Kaiser-Squires density reconstruction, and then 
smoothing the reconstructed convergence field with 
a filter and sea rching for peaks (Reblinskv et alT] 
fl99 9l White, van Waerbckc. & MackgiJ ^ l200a 
iHamana. Takada. fc Yoshida.2003} . However, working 
directly with the shear is crucial, when one recalls 
that the surface mass density at any point depends 
on the galaxy ellipticities at all distances (see e.g., 
iBartelmann fc Schneideil 12001(1 . The kernel that the 
ellipticity data is convolved with in a density recon- 
struction is not local, decaying as 9^^. Because of edge 
effects from the field boundaries and all the masked 
bright stars, diffraction spikes, and bleed trails (see 
Figure 14 of iVan Waerbeke fc Mellie j pOO^ for a nice 
example), the non-local Kaiser-Squires reconstruction 
will propagate ringing from this 'window function' into 
other regions of the mass map, significantly amplifying 
noise. It should thus be avoided for the purposes of 
finding clusters. 

Besides avoiding the deleterious effect of the noisy 
window function or mask, convolving the shear with a 
compact kernel seems sensible because clusters of galax- 
ies have characteristic size on the sky. However, weak 
lensing statistics using small apertures are complicated 
by the non-local relationship between shear and mass. 
Any filter convolved with the shear within some aper- 
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Fig. 3. — Left panel: Kaiser-Squires reconstruction of the mean convergence field, k, for noiseless mock data. Right panel: Cluster 
likelihood map constructed with the TMF applied to noisy data. Source galaxies were binned into three coarse redshift bins (TMF3). 
Cluster statistics and the tomographic redshift probability distributions for the four clusters labeled a-d are shown in Figure Bl 




Fig. 4. — Cluster statistics and the tomographic redshift probability distributions for the four clusters labeled a-d in Figure |3] The 
mass, Mi4 = Mq, and redshift, z, of the cluster(s) responsible for each peak are labeled. Vertical dashed lines indicate the 

true redshifts of the halos along the line of sight. The location of the peak of the probability is the tomographic redshift, zj, which is also 
labelled. 



ture will have fluctuations imprinted on it from scales outside that aperture. The simplest case is the mass 
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sheet degeneracy, whereby the convergence field recon- 
structed from a source galaxies in a finite field can only 
be determined up to an additive constant or sheet of 
mass (Bartclmann & Schneider 2001). While the mass 
sheet can be intuitively identified with fiuctuations in 
the ^ ~ mode, all large scale structure fluctuations on 
scales larger than the aperture 9, £ < 1/9, result in a vari- 
ance of any statistic we apply to the s hear over a finite 
aperture. "Hockstra (2001, 200^ and IDodelsm] l|2003D 
have studied the effect of the variance due to large scale 
structure on measurements of cluster masses and density 
profiles. For the purposes of identifying clusters, vari- 
ance in our aperture statistic will increase the number of 
false detections, lowering the efficiency of cluster finding. 

The aperture m ass measures l|Schneideil 119961: 
ISchneider et al. lfl998<l constitute a class of filters which 
mitigate the aforementioned problem. Convolution of the 
K map with a kernel U can be shown to be equivalent to 
convolving the tangential shear map with a related kernel 
Q, provided that the kernel U is compensated 



dxxU{x) — 0. 



(29) 



In this work we consider the most widely used pair of 
kernels U and Q which are 



U{x) 
Q{x) 



9 



{l-x^) 



(1 ~ X-) 



(30) 



1998^) . its variance can be written 



where d is the size of the aperture and x = 9/d. 

The aperture mass has the appealing property that it 
provides a lower limit on the mass contained within the 
aperture 9, because of its equivalence to a convolution 
of K. Furthermore, it is relatively una ffected by modes 
larger than the filter scale. As shown bv lSchneider et al~l 



(31) 



where F{£d) is a notch filter which peaks at € ~ and 
suppresses fluctuations from scales smaller and larger. 
The fact that the aperture mass is insensitive to the mass 
sheet degeneracy is manifest by the fact that F{(. = Q) = 
0. Because modes larger and smaller than the aperture 
scale (matched to the size of a cluster) don't cause fluc- 
tuations in the aperture mass, we might expect this fllter 
to more efficiently locate clusters in weak lensing data. 
We return to this point below. 

A potential problem with using the aperture mass is 
that because it is compensated (eqn. I29f) it does not pro- 
vide a very good fit to the tangential shear profile of a 
galaxy cluster. Hence, we don't expect it to work as 
well with the tomographic technique introduced in the 
p revious section. 

Padman abhan. Seliak. fc PenI l)2003f) advocate using a 
matched filter with convergence profile 



1 



[l + xf 



(32) 



where x = 9 /9s, for several different values of the scale 
angl e 9^. This profile approximates a n NFW density pro- 
file ijNavarro. Frenk. fc Whit3ll997fl in projection, and 



has been used in studies of optical clusters (White fc 
Kochanek 2002; White et al. 2002). The tangential shear 
corresponding to this convergence profile is 



Gnfw(2;) 



21n(l +x) 



x{l + x) (l + a;)2' 



(33) 



where we have applied eqn. (|17|) . 

The soft core in this profile is a desirable feature, since 
convolving shear data with a divergent profile gives a 
large weight to a single noisy galaxy resulting in an ul- 
traviolet divergence, as is well known in the literature on 
weak lensing mass reconstructions (see Kaiser & Sauirgi 
I1993D . However, these NFW convergence and shear pro- 
files decay asymptotically as a power law ^ 9~^, and are 
not compact. As we will see, this has a deleterious effect 
on the number of clusters detected and on the efficiency 
of the cluster search, because of a dramatic increase in 
confusion from large scale structure. 

The foregoing discussion motivates the truncated pro- 
jected NFW profile 



G{x) — G'NFw(a;)exp 



29i 



(34) 



where we have truncated the filter in eqn. I|33|l by multi- 
plying with a Gaussian of truncation angle (?out- 

Note that the model shear profile in eqn. H34I) now has 
two free scale parameters 9s and ^out, whose values we 
must determine. In principle, we could leave them as pa- 
rameters, and maximize the likelihood in eqn. H23|l with 
respect to redshift and both angular scale factors. Tests 
on the simulations indicate that for fixed ^out , maximiz- 
ing the likelihood with respect to scale angle 9s and Zd 
performs slightly worse than the case where is fixed 
and the likelihood is maximized only with respect to z^. 
Furthermore, varying the scale angle 9s would require 
performing a whole sequence of convolutions with differ- 
ent size filters, which is more expensive computationally. 
For these reasons we choose to fix 9s = 0.50', which is 
roughly matched to both the pixel size in our simulated 
maps and the angle 9s subtended by the scale radius Tg 
of a 10^'' h-^ Mq NFW cluster at 2 ~ 0.4. Varying 
this scale angle from 9s — 0.10 — 0.70 produces neghgible 
changes in our results. 

Maximizing the likelihood with respect to the trunca- 
tion angle 9ont also performs poorly, because of a ten- 
dency to overfit noise and large scale structure on scales 
much larger than the size of a cluster. Also, the like- 
lihood for small and large 6'out will be over a different 
number of source galaxies and hence degrees of freedom. 
Thus their distributions will have different means and 
variances, complicating a comparison of the resulting 
likelihoods. For these reasons, we take the truncation 
radius ^out to be a 'prior' that we vary by hand, until 
the number of clusters detected for a given noise model 
is maximized. 

In addition to the aperture mass (eqn. pHl ). the pro- 
jected NFW profile (eqn. ^^), and the truncated NFW 
(eqn.lSJ), we also try simply convolving the tangential 
shear with a Gaussian filter of size 0out • 

In general, the degree of smoothing required to detect 
clusters from weak lensing data will depend on the level 
of noise for the particular set of observations, as higher 
noise will require more smoothing. Here we focus on the 
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noise model described in §2. Adapting a filtering scheme 
to different levels of noise will amount to varying the 
overall extent of the filter. 

4.2. Comparison of Filters 

Searching for shear selected clusters entails convolving 
the ellipticity data with a filter and searching for peaks in 
a map. In general some signal to noise ratio threshold, 
v, will be chosen such that peaks above this threshold 
are considered candidate clusters and peaks below it are 
discarded. Given a threshold v, a fraction of all peaks 
larger than will be identified as galaxy clusters and the 
remainder will correspond to false detections. 

We define the efficiency as a function of this threshold 

as 

/ N ^clustcrs(,> ^'j /o^^ 

e(i^) = TT-^- (35) 

'Speaks (> 1^) 

It is clear that raising the threshold i', will increase the ef- 
ficiency, but at the expense of fewer detections nciustcrs(> 
v), since there will be fewer total peaks greater than v. 
Rather than fix v at say Sa, where a represents the vari- 
ance of the noise, we consider the number of clusters 
detected as this threshold is varied. There are several 
reasons why one may wish to work at low thresholds and 
hence low efficiency, but we defer a discussion of this 
point to §5. Note that the definition in eqn. 1)35(1 allows 
us to assign an efficiency to any peak v in our maps, and 
we may speak of an efficiency cutoff of say 75%, where 
it is understood that the we keep all clusters above the 
threshold vj^ where eiiyrz) = 75%. 

In Figure|31we plot the number of clusters detected per 
square degree versus efficiency for the filters discussed 
above. All the aforementioned filters have an angular 
scale parameter, and will yield more or less clusters at a 
given efficiency as this scale is varied. For the aperture 
mass this scale is the size of the aperture The only 
scale in the NEW profile in eqn. H32|l is the scale angle 
9s. For the truncated NFW and the Gaussian, the scale 
parameter is the Gaussian cutoff angle 6'out- For each 
filter we plot two sets of curves corresponding to noisy 
and noiseless mock weak lensing data, with the outermost 
thin curves corresponding to the noiseless case. 

As the amount of smoothing is increased there is a 
tradeoff between averaging over more source galaxies, 
and hence increasing the signal to noise on the one hand, 
but increasing the level of confusion from cosmic struc- 
ture and merging distinct clusters together into one an- 
other. For a given amount of noise, the optimal scale 
for a filter corresponds to the curve furthest to the right 
in Figure 13 which maximizes the number of clusters de- 
tected at each efficiency. The dotted (blue) and dashed 
(red) curves correspond to under and over smoothing, re- 
spectively, and the solid (green) curves correspond to the 
'optimal' scale for the noisy data. Table El summarizes 
the information for the filters discussed in this section 
and plotted in Figure |S| The optimal smoothing scale is 
printed in bold. 

The benefit of using photometric redshift information 
is illustrated in Figure [HI where we smoothed with the 
optimal filter scale for each filter as determined from 
Figure El and listed in Tabic but varied the amount 
of photometric redshift information used. The dot- 
ted (blue) curves ignore photo-z information, the solid 



TABLE 3 
Filters used in Figure |21 



Filter 


Profile Eqn 


Scale Angle 


Angles 


NFW 


(23 




0.01,0.1,1.0 


Map 




1? 


2.0,3.0, 5.0 


Gaussian 




Sout 


1.5,2.0, 4.5 


Truncated NFW 




Oout 


2.0, 5.5, 10.0 



NO TES. — Parameters for the four filters functions G{x) (see eqn. 
I16i used in Figure |F] The table lists the defining equation of 
the profile, the angular scale parameter varied to set the level of 
smoothing, and the three angles used in the figure. The three an- 
gles correspond to undersmoothing (dotted curves in Figure [TSl . 
the optimal smoothing, (solid curves), and oversmoothing (dashed 
gurves), respectively. The optimal smoothing is listed in bold and 
all angles arc in arcminutes. 



(green) curves correspond to the TMF with the three 
coarse photometric redshift bins (TMF3), and the dashed 
(red) curves correspond to the TMF with the five finer 
(TMF5) photometric redshift bins (see Tabled. Both 
TMF's detect significantly more clusters for each filter at 
high and low efficiencies. 

We quantify the increase in clusters detected with the 
TMF in Table^I where we list the number of clusters per 
square degree with and without tomography. We com- 
pare all four filters at a high efficiency cutoff of e = 75% 
and a lower cutoff of e = 60%. To get an idea of the signal 
to noise ratios corresponding to the efficiency thresholds, 
we also list the signal to noise ratio at the efficiency cut- 
off, 1/75 and i-'6o, of the filtered maps which did not use 
tomography. The noise in the maps was computed by 
calculating the variance of smoothed maps of unlensed 
source galaxies which had the same positions and intrin- 
sic ellipticities as the lensed sources. An efficiency of 
e > 75% corresponds to peaks with a signal to noise ra- 
tio V > 4.5, whereas e > 60% corresponds to signal to 
noise cutoff of v > 3.5. The TMF increases the num- 
ber v > 4.5 peaks by up to 76%. For lower thresh- 
olds, the increases in the number of clusters detected is 
less substantial (~ 30%), because the tomography is less 
effective at determining the redshift of the cluster for 
lower signal to noise detections. The TMF increases the 
number of clusters detected for all filters; however, the 
gains are more substantial for the truncated NFW profile 
which best approximates the tangential shear profile of 
clusters. 

Besides increasing the total number of clusters de- 
tected, the TMF substantially increases the dynamic 
range of the resulting cluster sample. The redshift and 
mass number count distributions detected by the trun- 
cated NFW profile for an efficiency cutoff of e > 75% are 
shown in Figure [51 The solid histogram is for the TMF3 
whereas the dotted (blue) line does not use photometric 
redshifts. At low redshifts and high mass, the number of 
clusters detected are nearly the same, however the TMF 
detects many more high redshift clusters and it extends 
the mass sensitivity down to the scale of large groups. 

As is evident from Figures El and Table ^ dividing 
source galaxies into three coarse redshift bins is sufficient 
to reap the benefits of tomographic information and in- 
crease the number of clusters detected. Such a coarse 
redshift division can be achieved with as few as two col- 
ors and an apparent magnitude prior, so that the TMF 
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Fig. 5. — Number density of clusters versus efficiency for four filters. Dotted (blue) curves are undersmoothed, dashed (red) curves are 
over smoothed, and solid (black) curves indicate the filter size that performs best. The inner set of curves are noisy mock data and the 
thin outer curves are for noiseless data. A summary of information on the filters and aperture scales used in this figure is given in Table l3l 



can be used with deep imaging data in as few as three 
pass bands. 

A comparison of the four filters we consider is pre- 
sented in Figure [3 The filters shown are the optimal 
aperture mass (t? = 3.0') , Gaussian {Bout = 2.0'), NEW 
(Os = 0.1'), and truncated NEW {Oont = 5.5'). The left 
panel doesn't use tomography and the right panel is for 
the TME3. The thin outermost set of curves are for 
noiseless data. 

Eor noisy data, the truncated NEW is most effective 
at finding clusters for efficiency cuts between 60 — 75%, 
corresponding to detection significance of v = 3.5 — 4.5, 
detecting ~ 20 — 30% more clusters, especially if tomog- 
raphy is used. The non-truncated NEW filter performs 
significantly worse than the others, even at very high 
signal to noise ratios, because it is not compact enough, 
dropping off as 9^^. 



At the highest efficiencies e > 0.8 corresponding to 
peaks with i/go ^ 6, the aperture mass detects more clus- 
ters than the other filters. Eor these highest peaks, the 
aperture mass is less affected by large scale structure 
and noise, and does the best job of sorting out projec- 
tion effects. Whereas the truncated NEW ranks peaks 
by some combination of mass and also goodness of fit to 
the NEW profile, by definition, the aperture mass ranks 
peaks by mass, since it provides a lower limit on the mass 
enclosed within the aperture. Eurther, this result is not 
unexpected in light of our discussion of the variance of 
the aperture mass (eqn. I31|l in M.l\ Notch filtering of the 
shear field results in a smaller variance from both noise 
and large scale structure, which for the most massive 
clusters ^ 10^^ Mq clusters, increases the efficiency 
of the cluster search. 

A conspicuous feature of Eigures [SJ El and |3 is that 
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Fig. 6. — Number density of clusters versus efficiency for four filters using varying degrees of photometric redshift information. Dotted 
(blue) curves use no photometric redshift information, dashed (red) are for sources binned into five fine photometric redshift bins (TMF5), 
and solid (green) curves are for the sources binned into three coarse redshift bins (TMF3). The inner set of curves are noisy mock data and 
the thin outer curves are for noiseless data. The TMF exploits the information provided by source photometric redshifts to increase the 
number of clusters detected at all efficiencies, even for crude redshift bins. A comparison of the performance of these filters is presented in 
Table m 



even for the unachievable case of no intrinsic cUipticity 
or Poisson noise, the maximum intrinsic efficiency of 
weak lensing cluster searches is only ~ 85%. Thus 
~ 15% of even the most significant peaks detected in 
noiseless weak lensing maps do not have a collapsed 
halo with M > 10^^-^ Mq within 3 arcmintues 
(which is our peak halo matching criteria). This 
intrinsic inefficiency is due to projection effects and 
to confusion from cosmic structures, and has been 



noted by previous s tudies llMetzler. White, fc Loken 



2001 



2002: 



IWhite. van Waerbeke. fc Mackey 

iPadma nabhan Sel iaJk. fc Fed 120031 

Hamana. Takada. fc Yoshida 200"^ . This fact sheds light 
on the purported detections of 'dark clumps' reported 
recently by several groups ijFischeil Il999t lErben et al.l 



2000t lUmetsu fc Futamas^ l2000t IMiralles et all l200l 
Dahleer^Ln200^ These objects could be entirely 
consistent with the ~ 15% false detections, though a 
definitive conclusion will have to wait for future wide 
field weak lensing surveys. Because of the intrinsic 
inefhciency that we find here, only statistical statements 
can be made about a population of dark clusters. 

5. COMPLETENESS OF SHEAR SELECTED SAMPLES 

In this section we discuss the completeness of shear 
selected cluster samples detected by the filter which per- 
formed best in the previous section. Specifically, we con- 
sider the truncated NFW filter with 6'out =5.5' without 
photometric redshift information and with source galax- 
ies binned into three coarse redshift bins (TMF3). The 
left panel of Figure |51 shows the completeness as a func- 
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Fig. 7. — Comparison of the four filter functions with and without tomography. The left panel uses no photometric redshift information, 
whereas the right panel uses tomography with source galaxies binned into three coarse redshift bins (TMF3). The optimal filtering scale 
is used for each filter (see Table The inner set of curves are for noisy data and the thin outer curves are noiseless. The dot-dashed 
(magenta) curve is for the NFW profile, the dashed (red) curve is the aperture mass, the dotted (blue) curve is the Gaussian, and the solid 
(green) curve is the truncated NFW. The amount by which tomography enhances the performance of each filter is listed in Table HI 



TABLE 4 
Filter Comparison 



Filter 



"TMF5(> 60%) % increase 



NFW 


4.1 


4.45 


5.34 


6.27 


41 


6.3 


0.28 


0.34 


0.29 


19 


Map 


3.5 


2.72 


3.33 


3.38 


25 


4.5 


0.48 


0.63 


0.51 


33 


Gaussian 


3.4 


4.94 


6.38 


6.54 


32 


4.4 


1.10 


1.47 


1.28 


34 


Truncated NFW 


3.4 


6.20 


7.53 


7.70 


24 


4.9 


0.99 


1.75 


1.45 


76 



U75 n{> 75%) raTMF3(> 75%) nTMF5(> 75%) % increase 



NOTES. — Comparison of filters with and without tomography. The number of clusters detected per square degree without tomography, 
with coarse redshift tomography (TMF3), and fine redshift tomography (TMF5), are denoted by n, n-rMFSi a^nd '^TMFS- Two efficiency 
thresholds are compared, 60% and 75%, and the S/N ratio threshold (computed from the maps without tomography) corresponding to 
each efficiency cut, ueo and u75, are listed. The column labeled "% increase" shows the improvement when tomography is used of either 
the TMF3 or TMF5, whichever is greater. 



tion of redshift for all clusters with M > 10^'* '^ Mq. 
The right panel shows completeness as a function of clus- 
ter mass for clusters in the redshift range 0.2 < z < 0.8. 
Thick lines are for an efficiency cutoff of e > 60% {S/N > 
3.5) whereas thin lines are for e > 75% S/N > 4.5. For 
the lower efficiency cut of e > 60%, the completeness is 
> 50% only for the most massive clusters M > 10^'*-^ 
which are near the peak of the lensing efficiency at 
z ~ 0.3. Limiting the cluster search to the higher ef- 
ficiency threshold e > 75% further reduces the complete- 
ness by ~ 40%. As a function of cluster mass the com- 
pleteness only approaches unity for the most massive 
clusters M 10^^ Mq, where we begin to suffer 
from small number statistics. 

Previous studies of shear selected clusters carried out 
by s everal groups llWhite. van Waerbeke. fc Mackeyl 

200l iPadmanabhan Seliak fc"Penl 12003 

Hamana. Takada. fc Yoshidal 120031) ' come to a simi- 
lar conclusion. Namely, that shear selected cluster 
samples suffer from severe incompleteness except at 



the highest masses. Although our results are broadly 
consistent with these studies, a direct comparison 
is difficult because the noise levels, source redshift 
distributions, aperture to match peaks to clusters, 
and filteri ng techniques employed are all d i fferent . In 
particu lar 'Wh ite, van Waerbeke. fc MackevI l)2002() and 
iHaman a. Taka da. & Yoshidal l|2003D use a single source 
redshift plane which implies the lensing efficiency will 
be a narrower function of redshift than that given 
by the source redshift distributioii s cons idered here 
and in IPadm anabhan. Seliak. fc PenI l|2003(l . A source 
redshift distribution results in lower completeness than a 
single source plane (|Padmanabhan. Seliak, fc Pen"2003'^ 
because the broader lensing kernel increases noise from 
projections of large scale struture. 

6. TOMOGRAPHIC REDSHIFTS 

In this section we characterize the reliability of the to- 
mographic redshifts determined by maximizing the like- 
lihood in eqn. (|23() (see Figure^, and study how reliabil- 
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Fig. 8. — Redshift (left) and mass (right) number count distributions of clusters detected by the truncated NFW with and without 
tomography. Dotted (blue) histograms do not use photometric redshift information while solid (black) histograms are for the source 
galaxies binned into three coarse redshift bins (TMF3). The cluster samples used to construct these histograms used an efficiency cutoff 
of e > 75% corresponding to a S/N > 4.5. Even with just three coarse redshift bins, the TMF increases the number of clusters detected 
by 76% (see Table 1^. Tomography increases the dynamic range of weak lensing searches for clusters, detecting more high redshift clusters 
and extending the mass sensitivity down to the scale of large groups. 
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Fig. 9. — Completeness of shear selected clusters detected with and without tomography. The left panel shows completeness as a function 
of cluster redshift for all clusters in the light cone above 10^^'^ M©. The right panel shows completeness as a function of cluster mass 
for all clusters in the redshift range 0.2 < z < 0.8 for which the efficiency for lensing is appreciable. Dotted (blue) histograms do not use 
photometric redshift information while solid (black) histograms are for the source galaxies binned into three coarse redshift bins (TMF3). 
Thick lines are for an efficiency cutoff of e > 60% [S/N > 3.5) whereas thin lines are for e > 75% [S/N > 4.5). The left panel is cutoff at 
z = 0.2 because below this redshift our simulation scheme does not accurately reproduce the counts of clusters (see Figure 1^ 



ity depends on detection significance, mass, and redshift. determine the effect on the tomographic redshift errors. 
Then we will vary the filter profile G{6) in eqn. H16|l and For the rest of this section we focus on a sample of 
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shear selected clusters obtained with the TMF3 for the 
truncated NEW profile with 0out = 5.5' applied to noisy 
data. We showed in the previous section that this filter- 
ing scheme was superior to the rest for an efficiency cutoff 
of e = 75% corresponding to i/ > 4.5 detections (see Ta- 
ble 0J . The total number of clusters in the simulated 
608 deg^ above this cutoff is 1060, sufficient for a statis- 
tical study. Although in what follows we consider both 
noisy and noiseless data, the cluster sample will always 
remain the top 1060 clusters with e > 75% detected in 
the noisy maps. The mass, redshift, and likelihood distri- 
bution of the cluster sample is illustrated by the scatter 
plots in Figure HTJl The colors and sizes of points reflect 
the likelihood, or detection significance. Note that like- 
lihood of a cluster is a function of both cluster mass and 
redshift. 

In order to construct the likelihood maps for the TME3 
(Figure |3Jl we binned the source galaxies both spatially 
and in redshift as in eqn. H26|l. so that the convolutions 
could be performed quickly with EFT's. However, for the 
purpose of characterizing the reliability of tomographic 
redshifts we evaluate the likelihood exactly carrying out 
the full sums in eqn. (|23|l . 

6.1. Tomographic Redshift Errors 

In Figure [TTl we show scatter plots of tomographic red- 
shift, Zd, versus real redshift where again points are col- 
ored according to their detection significance and higher 
likelihood points are larger. The left panel includes noise 
while the right panel is for noiseless data. The filter pro- 
file G{9) used to determine the tomographic redshift (see 
ean. [TS|) was the truncated NEW profile with Og ~ 0.50' 
and ^out = 5.5', which detected more clusters than the 
others. For noisy weak lensing data, 45% of clusters 
have tomographic redshifts with |Az| < 0.2, where 
Az = Zroai — Zd- The rms deviations from the real red- 
shifts are ctaz = ((Az)^)^/^ = 0.41. For the ideal case of 
no noise these numbers change to 67% within \Az\ < 0.2 
and (TAz = 0.34. The reliability of the tomographic red- 
shifts for the noiseless case refiect the intrinsic limita- 
tions of this effectively two dimensional method. The 
broad lensing kernel and projections of large scale struc- 
ture limit the accuracy to which radial positions can be 
determined. 

From the colors and sizes of the points in Figure ITTl it 
is clear that tomographic redshifts arc more accurate for 
clusters with a higher detection significance. In Figure lT^ 
we show scatter plots of redshift error Az versus detec- 
tion significance, which illustrates that the redshift errors 
taper down dramatically for more significant detections. 
If there were a candidate 'dark clump' corresponding to a 
very high peak in a likelihood map, the tomographic red- 
shift might provide the redshift of the cluster reasonably 
well, and would be the only means available to determine 
a redshift in the absence of a significant overdensity of 
galaxies. However, we saw in §3.2 that ~ 15 % of the 
most significant peaks were projections, so there would 
be no guarantee that the object was an actual cluster, 
although the tomographic redshift information could be 
taken into account when devising a strategy for follow 
up. 

Scatter plots of the redshift error Az versus mass and 
redshift are shown in Figure El and Figure El As ex- 
pected, the tomographic redshift errors decrease for more 



TABLE 5 
Tomography Tests 





Noisy 


Noiseless 


Filter 


|Az| < 0.2 




lAzI < 0.2 




NFW 9out 


= 3.0 


42% 


0.44 


67% 


0.36 


NFW Bout 


= 5.5 


45% 


0.41 


67% 


0.34 


NFW Sout 


= 10.0 


46% 


0.40 


67% 


0.35 


NFW 9out 


= 15.0 


47% 


0.39 


66% 


0.36 


Isothermal 


9out = 5.5 


46% 


0.40 


65% 


0.35 



NOTES. — Summary of tests on accuracy of tomographic redshifts. 
The column labeled indicates the filter used, and the left and right 
columns indicate the fraction of clusters with \Az\ < 0.2 and the 
variance of the tomographic redshifts ctaz for noisy and noiseless 
mock sources. 



massive clusters and for clusters closer to the peak of 
weak lensing kernel (~ 0.4). However, the likelihood of 
the detection is clearly a better indicator of the tomo- 
graphic redshift reliability than mass or redshift, since 
the strength of the lensing signal is determined by both 
the mass and redshift of a cluster. 

6.2. Tests of Tomography 

The truncation scale ^out in eqn. (|34|l that optimizes 
the number of clusters detected is not necessarily opti- 
mal for minimizing the error Az of tomographic redshifts 
Zd of the clusters. We applied the TMF to our sample 
of clusters with truncation scales 0out = 3.0', 10.0', 15.0' 
and found negligible improvement in the tomographic 
redshift errors. 

Finally, we quantify the degree to which tomographic 
redshifts depend on the assumed model cluster profile 
G{x) in eqn. H16|l . by computing tomographic redshifts 
for the same sample of clusters but using a different shear 
profile. We consider an isothermal sphere with a core, 
where the core radius is set to the same value of 6s — 0.50 
used for the NFW scale radius profile and we adopt a 
Gaussian truncation scale of ^out = 5.5. An isothermal 
sphere with a core has tangential shear profile 

1 + -x'^ 

^iso(a;) - (f^^' (36) 

(|Blandford fc Kochaneklll987f) with x^e/Os- This pro- 
file has an outer power law slope of ~ 0^^, different from 
the 6-'^ scaling of the NFW profile used with the TMF. 
Overall, the isothermal profile gives tomographic redshift 
errors similar to the NFW profile used in this section, al- 
though for a particular cluster the redshifts deduced can 
be different. 

Table [3 summarizes the results of this section on the 
reliability of tomographic redshifts, and its dependence 
on truncation scale and filter profile shape. 

7. DISCUSSION 

We have described a fast efficient simulation scheme 
for computing the statistics of shear selected galaxy clus- 
ters from weak lensing. By tiling the line of sight with 
32 unique PM simulations, our algorithm is unique in 
that it densely samples the primordial Gaussian distribu- 
tion of random phases of large scale structure along the 
light cone, providing an accurate representation of the 
statistics of rare events. We have confirmed that the two 
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Fig. 10. — Left: Scatter plot of likelihood ver- 
sus cluster mass and redshift for 1060 clusters de- 
tected with the TMF3 above the efficiency cutoff 
of 75%. The colors and sizes of the points reflect 
the detection significance of the cluster. Because 
of the redshift dependence of the lensing kernel, 
the likelihood is a function of cluster redshift in 
addition to mass. Right: The z — M plane of the 
scatter plot at left. 
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Fig. 11. — Tomographic redshift versus real redshift for noisy (left) and noiseless (right) data. The colors and sizes of the points reflect 
the detection significance of the cluster. The fraction of clusters with tomographic redshift errors |A^| < 0.2 is 45% and the root mean 
square deviation from the real redshifts is {{Az)^)^^^ = 0.41 (see Table IS! . 
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Fig. 12. — Tomographic rcdshift errors versus likelihood for noisy (left) and noiseless (right) data. The colors and sizes of the points 
reflect the detection significance of the cluster. 
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Fig. 13. — Tomographic redshift error versus cluster mass for noisy (left) and noiseless (right) data. The colors and sizes of the points 
reflect the detection significance of the cluster. 



in the light cone (Figure EJ are accurately reproduced. 
Although the dynamic range of our PM simulations is 
limited, we argued that high resolution simulations are 
not necessary to study shear selected cluster because of 
the small scale shot noise limit of weak lensing observa- 
tions. 

A maximum likelihood tomographic technique was pre- 
sented which utilizes the photometric redshifts of source 
galaxies to determine the radial position of a galaxy 
cluster from weak lensing alone. We introduced a to- 
mographic matched filtering (TMF) shceme, which opti- 
mally incorporates this additional radial information to 
detect more clusters. We applied the TMF to a large 
ensemble of simulations and found that it enhances the 
number of clusters detected with S/N > 4.5 by as much 



as 76% (see Table H and |H1). As illustrated by Fig- 
ure |S1 the TMF increases the dynamic range of weak 
lensing searches for clusters, detecting more high red- 
shift clusters and extending the mass sensitivity down 
to the scale of large groups. Furthermore, binning the 
sources coarsely using only three redshift bins was suffi- 
cient to reap the benefits of tomography. Thus the fil- 
tering scheme developed here can be applied to ground 
based weak lensing observations in as few as three bands, 
since two colors and a magnitude provide enough infor- 
mation to bin sources this crudely. 

For the case where the photometric redshifts of sources 
are known more precisely, as could be achieved with mul- 
ticolor ground based data, we quantified the errors in the 
tomographic redshifts. For the densities of sources used 
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Fig. 14. — Tomographic redshift error versus cluster redshift for noisy (left) and noiseless (right) data. The colors and sizes of the points 
reflect the detection significance of the cluster. The linear feature in the upper left corner of the plots are an artifact of a slight tendency 
for the tomography to map low signal no noise detections to = 0, hence Az = z. 



in this paper, 45% of clusters detected with signal to 
noise ratio S/N > 4.5 had tomographic redshift errors 
|Az| < 0.2 and the rms deviation from the real redshifts 
is a^z = 0.41, with smaller errors for higher S/N ratios. 

The most appealing property of shear selected clus- 
ter samples in comparison to other methods of detect- 
ing clusters, is that the expected cluster distributions can 
be reliably determined for any cosmological model using 
simulations like those described in this paper. Because 
on the scales of interest only gravity is involved, compar- 
ing theory to observations does not, in principle, depend 
on assumptions about the relationship between dark and 
luminous matter. However, in practice projection effects 
and incompleteness are likely will complicat the interpre- 
tation of shear selected samples. 

Weak lensing searches for clusters are plagued 
by projection effects. In particular, we found that 
the maximum intrinsic efficiency of weak lensing 
cluster surveys is ^ 85%. Phrased in terms of our 
criteria for matching peaks with clusters (§2.5), of 
order ~ 15% of the most significant peaks detected 
in noiseless weak lensing maps do not have a col- 
lapsed halo with M > 10^^-^ h''^ Mq within a 3' 
aperture. This intrinsi c inefficiencv, also noted bv previ- 
ous investigators llWhite. van Waerbeke. fc MackevI 

2002t [Padmanabhan. Seliak. fc PenI 120031: 

Hamana. Takad a. & Yoshida 2003), arises because 
of the broad weak lensing kernel and confusion from 
large scale structure fluctuations. 

The calculated maximum intrinsic efficiency is sen- 
sitive to parameters. Lowering the limiting mass, 
2Q13.5 f^-i y^Q^ of our cluster catalog, increasing the size 
of the aperture, 3', used to correlate peaks with halos, or 
reducing the linking length, b = 0.2, of the FOF group 
finder, would all increase the efhciencies deduced. Nev- 
ertheless, some of the most significant peaks in the mass 
maps do not correspond to actual clusters. While clus- 
ter finding in numerical simulations involves applying 



a group finding algorithm to 3-d dark matter particle 
distributions, observationally determining whether there 
is a galaxy cluster at the location of a given peak will 
depend on observational criteria like the overdensity of 
galaxies, X-ray flux, SZ decrement, or the presence of a 
brightest cluster galaxy. Thus, any criteria employed to 
distinguish the real clusters from the projections will rely 
on detecting baryons. Because our ability to detect these 
baryons is a function of mass and redshift, and because 
of the scatter between baryonic observables and the un- 
derlying mass of the dark matter halo, the projection 
effects in weak lensing searches will degrade the well de- 
fined selection function deduced from 'dark matter only' 
simulations like those employed in this work. 

In the context of using cluster counts to constrain cos- 
mological parameters, the uncertainties in the selection 
function caused by projections are exacerbated by the 
high incompleteness of shear selected cluster samples. 
Even for an efficiency cut of e > 60%, the completeness 
is > 50% only for the most massive clusters M > 10.0^"' '^ 
which are near the peak of the lensing kernel at z ~ 0.3, 
and lower everywhere else. The severe projection effects 
imply that these missed clusters are likely to scatter in 
or out of the detected sample. In light of the fact that 
this number of missed clusters is of the same order as the 
number detected, an accuracy of x% in the final cluster 
sample requires a theoretical understanding the selection 
function to better than ~ x%. This is likely to be very 
hard. One alternative is to throw away the number count 
information and use only the shape of the redshift distri- 
bution of shear-selected clusters to constrain cosmology 
since the shape will be much less sensitive to the precision 
to which the selection function is known. 

Cross correlation of shear selected cluster samples 
with overlapping optical, SZ, or X-ray cluster catalogs 
is an important cosmological application (Schirmer et 
al. 2003,2004; Hughes et al. 2004). For such a cross- 
correlation it is likely to be advantagous to work at lower 
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efSciency thresholds, considering that the number of clus- 
ters per square degree for an efficiency e > 60% is five 
times larger than that with e > 75% (see Table^)). These 
less significant peaks will span a larger mass and redshift 
range increasing the amount of overlap between the two 
cluster samples. Besides addressing probable baryonic 
biases for baryon selected clusters, cross correlation could 
be the most expedient route to precision cosmology with 
cluster surveys. In order to recover parameter sensitivity 
degraded by the scatter in the mass-observable relations, 
iMaiumdar fc M ohr ( 2003) advocate calibrating these re- 
lations with extensive follow up mass measurements of a 
fraction of the clusters in a 'baryon selected' sample. An 
alternative to such a follow up campaign of individual 
clusters is to conduct a parallel weak lensing survey, and 
use the clusters detected from both weak lensing and, 
say SZ, to determine the mass-observable relation statis- 
tically. Because the mass and redshift distributions of 
the shear selected clusters is known for any cosmological 
model, the distribution of dual weak lensing-SZ detec- 
tions can be used to simultaneously constrain cosmology 
and the relationship between mass and SZ decrement. In 
addition to the statistical mass-SZ decrement calibration 
provided by the distribution of dual detections, stacked 
weak lensing mass measurements of the individual SZ 
clusters can be used to provide another constraint on this 
relation. Finally, it is worth emphasizing that an op- 
tical cluster search JPqstman et al.lll996t iKeoner et al.l 
[19991 IGladders fc Yed nmH ^^hefcKoch£me^l2003 
IKochanek et al.*2003^ could be conducted using the same 
deep imag ing data us ed to m easure weak lensing. Indeed, 
iSchirmer et alJ l|2b0ii . l200^ have already combined shear 
selection with optical color selection, and confirmed sev- 
eral color-selected optical cluster candidates with weak 



lensing. Such parallel analyses of shear-selected cluster 
samples and baryonic cluster samples will provide mass 
calibration, help break parameter degeneracies, and elim- 
inate systematics inherent in cluster surveys. 

The potential of shear selected clusters to provide pre- 
cision constraints on the dark energy, either alone or 
in conjunction with another baryonic cluster survey, are 
topics which merit future investigation. 
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